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Abstract 

This  paper  shows  that  growth  models  featuring  the  representative  consumer 
(RC)  assumption  can  generate  rich  dynamics  for  the  cross-sections  of  con- 
simiption,  wealth  and  income.  We  consider  a  class  of  growth  models  with 
three  somrces  of  consimier  heterogeneity:  initial  wealth,  non-acquired  skills 
and  taste  for  consumption-smoothing.  Despite  this  heterogeneity,  the  mod- 
els in  this  class  admit  a  RC  and,  consequently,  exhibit  aggregate  dynamics 
that  are  indistingmshable  from  those  of  the  standard  homogeneous-consiuner 
models.  One  could  therefore  interpret  our  research  as  an  attempt  to  make 
expUcit  the  distributive  dynamics  that  imderUe  the  popular  RC  models. 

We  examine  the  behavior  of  consmners'  relative  consimaption,  wealth  and 
income,  and  derive  cross-sectional  equations  that  show  how  these  quantities 
(at  any  date)  are  related  to  both  average  variables  and  the  consumer's  in- 
dividual characteristics.  Using  these  equations,  we  study  the  distributive 
dynamics  of  the  Ramsey  and  the  Unear  growth  models.  While  the  former 
is  consistent  with  the  existence  of  alternating  periods  of  conditional  conver- 
gence and  divergence  in  a  cross-section  of  individual  incomes,  the  latter  is 
not.  Using  data  from  the  Panel  Study  of  Income  Dynamics  (PSID),  we  find 
evidence  of  conditional  convergence  in  the  1970s,  and  divergence  thereafter. 
This  finding  is  therefore  inconsistent  with  the  hnear  growth  model,  but  not 
with  some  versions  of  the  Ramsey  model. 


1     Introduction 

The  widespread  use  of  the  representative  consumer  (RC)  assumption  in  the 
theory  of  economic  growth  has  lead  m£iny  to  beUeve  this  hne  of  research 
has  httle  to  say  about  distributive  issues.  This  paper  argues  that  quite  the 
opposite  is  true.  We  consider  a  class  of  growth  models  with  three  sources 
of  consmner  heterogeneity:  initial  wealth,  non-acquired  skiUs  and  taste  for 
consmnption-  smoothing.  Despite  this  heterogeneity,  the  models  in  this  class 
admit  a  RC  and,  consequently,  exhibit  aggregate  dynamics  that  axe  indis- 
tinguishable from  those  of  the  standard  homogeneous-consvuner  models.  Yet 
this  does  not  mean  that  the  models  we  study  do  not  have  interesting  distribu- 
tive impUcations.  On  the  contrary,  we  find  that  they  generate  rich  dynamics 
for  the  cross-sections  of  consimiption,  wealth  and  income.  One  could  there- 
fore interpret  our  research  as  an  attempt  to  malce  exphcit  the  distributive 
dynamics  that  imderhe  the  popular  RC  models. 

The  RC  is  a  fictional  consumer  whose  utihty  maximization  problem  when 
facing  aggregate  resource  constraints  generates  the  economy's  aggregate  de- 
mand functions.  The  RC  assumption  does  not  necessarily  rule  out  consumer 
heterogeneity,  but  only  requires  that  potential  sources  of  consumer  hetero- 
geneity have  sufficient  structure  to  ensmre  that  the  smn  of  all  consumers 
behaves  as  if  it  were  a  single  consimier.  For  instance,  if  we  are  not  wilhng  to 
restrict  the  set  of  admissible  wealth  distributions,  a  necessary  and  sufficient 
condition  for  a  model  to  admit  a  RC  is  that  all  consimiers'  preferences  can  be 
represented  by  indirect  utihty  functions  of  the  Gorman  form  (of  which  special 
cases  axe  homothetic  and  quasilineax  utihty  functions).^  If  we  are  wilhng  to 
impose  enough  restrictions  on  the  set  of  admissible  wealth  distributions,  it  is 
then  possible  to  enlarge  the  class  of  preferences  that  lead  to  the  RC  property. 

It  is  difficult  to  overemphasize  the  analytical  convenience  of  the  RC  as- 
sumption. In  economies  with  heterogeneous  agents  the  behavior  of  average 
quantities  depends,  in  general,  on  how  these  averages  axe  distributed  across 
consumers.  The  RC  assmnption  restricts  the  behavior  of  average  quantities 
to  depend  exclusively  on  these  same  averages,  and  not  on  their  distribution. 


^Let  Wj  be  the  wealth  of  consumer  j,  and  let  p  be  a  given  price  vector.  The  Gorman 
indirect  utihty  function  is  V(p,  Wj)  =  aj(p)  +/3(p)  ■  Wj  (note  that  £ill  consumers  must  have 
the  same  wealth  coefficient).  See  Mas-Colell,  Whinston  and  Green  (1995,  chapter  4)  for 
a  review  of  aggregation  results,  including  a  discussion  of  the  concept  of  a  representative 
consumer. 


Since  keeping  track  of  averages  is  always  simpler  than  following  distribu- 
tion functions,  this  first  property  of  RC  models  constitutes  a  major  source 
of  analytical  tractabihty.  Moreover,  since  disaggregated  data  sets  are  rel- 
atively scarce,  it  is  convenient  to  work  with  theories  that  deliver  testable 
impUcations  in  terms  of  average  variables  and  not  on  their  whole  distribu- 
tion. But  this  is  not  all.  We  know  that,  in  general,  average  demand  functions 
can  take  basically  any  form  (Sonnenschein-Mantel-Debreu  theorem).  In  RC 
models  average  demands  are  generated  as  a  solution  of  an  individual  con- 
sumer problem  and,  as  a  result,  they  satisfy  all  the  properties  of  individual 
demand  functions.  In  appUed  work  these  properties  are  extremely  useful, 
since  they  allow  us  to  derive  testable  impUcations  from  theoretical  models. 

Indeed,  since  much  of  existing  growth  theory  is  based  on  the  RC  assumpn 
tion,  it  is  possible  to  imderstand  this  paper  as  an  attempt  to  expand  the 
range  of  applications  of  existing  growth  models  to  include,  in  addition  to 
the  usual  predictions  regarding  average  quantities,  also  predictions  regarding 
how  these  quantities  axe  distributed  in  a  cross-section  of  consumers.  Our 
paper  is  therefore  closely  related  to  StigUtz  (1969)  and  Chatterjee  (1994). 
These  papers  study  the  evolution  of  the  distribution  of  wealth  in  the  Solow 
and  in  the  Ramsey  model,  respectively.  Our  work  differs  from  theirs  in  three 
dimensions.  First,  we  introduce  additional  sources  of  consmner  heterogene- 
ity. Second,  our  results  are  more  general,  since  they  apply  to  any  RC  growth 
model.  Finally  and  most  important,  we  derive  equations  that  explicitly  relate 
the  evolution  of  the  cross-sections  of  consumption,  income  and  wealth  to  the 
evolution  of  average  quantities  and  prices  and  the  individual  characteristics 
of  the  consvuners. 

We  study  a  class  of  RC  growth  models  with  three  sources  of  consimier 
heterogeneity:  initial  wealth,  non-acquired  skills  and  taste  for  consumption- 
smoothing.  In  particular,  we  examine  the  behavior  of  consumers'  relative 
consmnption,  wealth  and  income,  i.e.  the  ratio  of  a  consumer's  consumption, 
wealth  and  income  to  the  economy's  corresponding  average.  We  derive  cross- 
sectional  equations  that  show  how  these  quantities  (at  any  date)  are  related 
to  both  average  variables  and  the  consimaer's  individual  characteristics.  Since 
average  variables  are  independent  of  the  distribution  of  individual  character- 
istics by  virtue  of  the  RC  assumption,  one  can  use  a  three  step  algorithm  to 
extract  empirical  impUcations  for  the  cross-sections  of  consimiption,  wealth 
and  income:  (i)  Solve  one's  preferred  RC  model  and  characterize  the  equi- 
Ubrium  time-path  of  average  quantities  and  prices;  (ii)  Make  assumptions 


regarding  the  distribution  of  individual  charzicteristics  (i.e.  initial  wealth, 
skills  and  tastes);  (iii)  With  the  information  obtained  in  steps  (i)  and  (ii) 
at  hand,  use  the  appropiate  cross-sectional  equations  to  obtain  testable  re- 
strictions regarding  the  evolution  of  any  initial  cross-section  of  consumptions, 
wealth  and/or  real  incomes. 

We  illustrate  the  use  of  this  algorithm  to  study  distributive  dynamics 
in  two  popular  RC  growth  models.  In  our  first  example  we  show  that,  un- 
der  a  standard  representation  of  technology,  the  Ramsey  model  generates  a 
Kuznets  curve,  that  is,  an  inverted-U  shaped  relationship  between  inequaUty 
measures  and  the  level  of  development  of  a  country.  In  the  second  example, 
we  completely  characterize  the  dynamics  of  distribution  in  a  hnear  growth 
model.  This  model  is  extremely  simple  and  permits  closed-form  solutions 
that  help  us  develop  intuition  about  the  forces  that  determine  the  evolution 
of  a  wealth  distribution  profile.  Both  the  Ramsey  and  hnear  models  are 
consistent  with  the  existence  of  conditional  convergence  and  divergence  in  a 
cross-section  of  individual  incomes.  One  empiricaJ  feature  that  distinguishes 
these  models  is  the  possibihty  of  reversals  in  the  patterns  of  convergence. 
While  the  Ramsey  model  can  accoimt  for  alternating  periods  of  convergence 
and  divergence,  the  hnear  model  cannot. 

Our  theoretical  results  allow  us  to  formulate,  for  a  cross-section  of  individ- 
ual incomes,  an  empirical  specification  that  is  analogous  to  the  cross-country 
or  cross-regional  "growth  regressions"  that  have  been  extensively  studied  in 
the  last  decade.^  We  use  data  firom  the  Panel  Study  of  Income  Dynamics 
(PSID)  to  characterize  the  dynamic  behavior  of  a  cross-section  of  family  in- 
comes in  the  1970's  and  1980's.  We  find  evidence  of  conditional  convergence 
in  the  former  decade,  and  divergence  in  the  latter.  Specifically,  our  results 
suggest  that  in  the  1970s  the  income  distribution  was  converging  towards  the 
distribution  of  skills  and/or  tastes,  while  in  the  1980s  it  diverged  firom  the 
distribution  of  skills  and/or  tastes  and  initial  income  differences  were  accen- 
tuated. As  already  mentioned,  this  reversal  in  the  patterns  of  convergence  is 
inconsistent  with  the  linear  growth  model,  but  not  with  the  Ramsey  model. 

That  the  RC  model  can  be  used  to  study  many  distributive  issues  does 
not  mean  that  all  the  important  questions  concerning  the  distribution  of 


^Examples  of  cross-country  studies  are  Baumol  (1986),  De  Long  (1988),  Baxro  (1991), 
Mankiw,  Romer  and  Weil  (1992),  Knight,  Loayza  and  Villanueva  (1993)  Barro  and  Sala-i- 
Martin  (1995)  and  Caselli,  Esquivel  and  Lefort  (1995).  Examples  of  cross-regional  studies 
axe  Barro  and  Sala-  i-Martin  (1991,  1992,  1995). 


income  can  be  addressed  within  this  framework.  For  example,  there  has  re- 
cently been  substantial  interest  in  the  question  of  how,  as  a  result  of  poUtico- 
institutional  factors^  and  labor^  and/or  financial^  market  imperfections,  al- 
ternative intitial  income  distribution  profiles  map  into  subsequent  income 
levels  and  growth  rates.  This  literature  is  summarized  in  Benabou  (1996). 
By  the  very  nature  of  the  problem  these  papers  analyze  (i.e.  the  effects  of 
lump-siun  redistribution  poUcies  on  income  levels  and  growth  rates),  the  use 
of  RC  models  is  ruled  out  from  the  start.  The  reason  is  that  the  RC  property 
implies  that  limip-smn  redistributions  do  not  affect  the  behavior  of  average 
quantities.^ 

The  paper  is  organized  as  follows:  Section  two  develops  the  basic  theory. 
Section  three  apphes  the  theory  to  the  Ramsey  and  linear  growth  models. 
Section  four  presents  some  empirical  results.  Section  five  concludes. 


^See,  for  instance,  Perotti  (1993),  Alesina  and  Perotti  (1994),  Alesina  and  Rodrik  (1994) 
and  Persson  and  Tabellini  (1994). 

^See,  for  instance,  Acemoglu  (1995). 

^See,  for  instance,  Loury  (1981),  Becker  and  Tomes  (1986),  Banerjee  and  Newman 
(1991  and  1993),  Galor  and  Zeira  (1993),  Behrman,  PoUack  and  Taubman  (1995)  and 
Piketty  (1996). 

®This  is  not  the  same  as  to  say  that  in  the  RC  models  any  redistribution  poUcy  does 
not  affect  aggregate  variables.  This  second  statment  is  false. 


2     Heterogeneity  in  the  RC  Model 

This  section  presents  a  model  with  heterogeneous  consumers,  in  which  the 
RC  property  emerges  as  an  aggregation  result.  We  characterize  the  behavior 
of  the  cross-sections  of  consumption,  wealth  and  income,  as  a  function  of  av- 
erage quantities  and  prices  and  the  distributions  of  individueil  characteristics. 

2.1     A  RC  Model 

Consider  a  one-good  economy  with  heterogeneous  consimaers,  indexed  by 
j  =  1,2,...,  J.  Time  is  continuous  and  there  is  no  uncertainty.  Let  Cj{t)  E  5R+ 
denote  the  constunption  of  agent  j  at  date  t,  and  c{t)  =  {ci{t),  ...,cj{t))  €  9?;^ 
be  the  economy's  vector  of  consmnptions.  To  simplify  notation,  we  omit  the 
time  indexes  when  this  is  not  confusing.  Consumers  have  preferences  that 
can  be  represented  by  a  Stone-Geary  utihty  function: 


/ 


i2i±M^JL . e-p-^ . dt  ie>o,p>o)  (1) 


where  Pj  e  5R  and  P  =  {Pi,...,Pj)  e^-^  .  We  do  not  impose  any  restriction  on 
the  vector  y0.  As  a  result,  a  first  sovurce  of  consvuner  heterogeneity  is  variation 
in  tastes.  As  we  shall  see,  the  Stone-Geary  utihty  function  is  a  specially 
convenient  paxametrization  of  preferences,  since  it  gives  rise  to  consumption 
demands  that  are  linear  in  wealth.  Despite  this  simphcity,  we  are  allowing 
the  elasticity  of  intertemporal  substitution  to  vary  both  in  a  cross-  section  of 
consmners  emd  in  a  time  series  of  a  single  consimier.  This  elasticity  is  given  by 

-^ -.  If  /5j  <  0  {Pj  >  0),  then  the  intertemporal  elasticity  of  substitution 

is  increasing  (decreasing)  in  the  level  of  consumption  Cj,  and  approaches  9~^ 
as  Cj  — >  oo.  If  ySj  =  0  ,  the  intertemporal  elasticity  of  substitution  is  constant 
and  equal  to  ^~^ .  Also,  for  a  given  value  of  consumption,  the  intertertemporal 
elasticity  of  substitution  increases  with  /3j. 

Consumers  derive  income  from  their  ownership  of  assets  and  the  labor 
they  provide.  A  second  source  of  constimer  heterogeneity  is  given  by  their 
initial  asset  holdings.  Let  aj{t)  €  3?  be  the  stock  of  assets  of  consumer  j  at 
date  t,  and  d{t)  =  (ai(t),  ...,aj{t))  e  3?"^  be  the  economy's  vector  (distribu- 
tion) of  asset  holdings.  Throughout  the  paper  we  interpret  the  consimier's 


stock  of  assets  in  a  broaxi  maimer,  so  as  to  incorporate  both  financial  wealth 
and  acquired  skills  or  education.  A  third  source  of  consimier  heterogeneity  is 
variation  in  labor  productivity  or  non-acqiiired  skills.  We  define  ttj  €  3J+  as 
the  labor  productivity  of  agent  j,  and  tt  =  (tti,  ...,  ttj)  G  3?:[  as  the  economy's 
vector  (distribution)  of  labor  productivity  parameters. 

Let  r{t)  e  5R+  and  w{t)  €  3?+  denote  the  rate  of  return  on  assets  and  the 
wage  rate  measured  in  efl&ciency  units,  at  date  t.  We  assume  that  r  and  w 
are  smooth  functions  of  time  with,  at  most,  a  finite  number  of  jumps.  Using 
this  notation,  we  can  write  consumer  fs  flow  budget  constraint  as  follows: 

dj  =  r  •  aj  +  w  •  iTj  —  Cj  (2) 

Since  we  interpret  assets  holdings  in  a  broad  manner,  it  would  be  inapv- 
propriate  to  equate  the  quantity  w  •  ttj  with  the  consiuner's  labor  income. 
The  latter  also  contains  the  return  to  himaan  capital  which  is  included  in  the 
quantity  r  ■  Uj. 

The  consimier 's  problem  of  maximizing  the  objective  function  (1)  subject 
to  the  flow  budget  constraint  (2)  hsis  solution:^ 


Cj  =  mi  •  Uj  +  m2  •  TTj  +  TTls  •  Pj 

where  mi,  m2  and  ma  are  defined  as  follows: 


(3) 


mi 


I     /    /i(r(l-e)-p).d«  \ 

=     \Je^  .dr 


-1 


/ 


^There  are  two  implicit  assumptions  in  this  result.  First,  consumers  are  free  to  borrow 
against  futiu-e  labor  income.  Second,  Ponzi  games  are  not  allowed.  The  first  assumption 
ensures  that  the  flow  budget  constraint  operates  at  all  times.    The  second  assumption 


implies  that  the  usual  transversaUty  condition  applies  (i.e.   lim  a^  •  {cj  +  /?, )     •  e"''*  =  0) 


t— >oo 


Jointly,  these  asstmiptions  allow  us  to  integrate  the  Euler  equation: 


Ci  =  ^-('--p)-(cj+^j) 

and  the  flow  budget  constraint  in  (2)  and,  combining  these  two  integrals,  obtain  the 
consumption  function  in  (3).  See,  for  instance,  Blanchard  and  Fisher  (1989,  chapter  2) 
and  Barro  and  Sala-i-Martin  (1995,  chapter  2). 


7     -/^■'^" 

1712    =    TUi  •     w  •  e    '        •  dr 


t 

oo 


-jr-dv 


TUs    =    — 1  +  mi  •  /  e   '        •  dr 


Note  that  the  consvimption  function  is  Uneax  in  Uj,  -kj  and  ^j.^  This  prop- 
erty of  the  chosen  representation  of  preferences,  combined  with  the  Unearity 
of  the  budget  constraint,  is  crucial  in  allowing  the  nice  aggregation  properties 
of  this  model. 

This  economy  admits  a  RC,  in  the  usual  sense  that  the  sum  of  all  con- 
sinners  behaves  exactly  as  if  the  economy  contained  a  single  consumer  with 
average  asset  holdings,  labor  productivity  and  taste  parameter.  To  see  this, 

let  c  =  -r  •  X;  Cj  and  a  =  —  •'^aj  be  the  average  per  capita  consmnption 
J     j  J     j 

and  assets.   Also,  let  the  average  value  of  the  taste  and  labor-productivity 

1  1    -^ 

parameters  he  P  =  —  •J2Pj  and  tt  =  —  ^  tTj  .  Summing  (2)  and  (3)  over  all 

consmners  and  dividing  by  J,  we  find  that: 

a  =  r  ■  a  +  w  ■  TT  —  c  (4) 

c  =  mi  •  a  -h  m2  •  TT  -t-  ma  •  /3  (5) 

These  equations  are  those  of  the  average  consumer.  This  imphes  that  all 
models  satisfying  our  assumptions  have  predictions  for  average  quantities  and 
prices  that  are  indistinguishable  from  those  of  the  homogeneoiis-consumer 
models.  Within  this  class  of  models,  if  one  is  only  interested  in  the  behav- 
ior of  averages,  there  is  no  loss  of  generality  in  assvuning  that  all  consumers 
are  identical,  and  readily  proceed  to  a  discussion  of  firms  and  technology. 
However,  one  should  not  be  misled  into  beUeving  that  RC  models  do  not 


A  consumer  with  Stone-Geary  preferences  might  choose  a  negative  consimiption  in 
some  periods.  Since  we  cannot  find  any  nieaningful  economic  interpretation  of  a  negative 
consimiption  rate,  we  rule  out  this  possibiUty  in  what  follows.  One  can  read  this  assump- 
tion as  a  restriction  on  the  distribution  of  individual  characteristics,  (i.e.  a(0),  ^  and/or 
tt)  or,  alternatively,  as  a  restriction  on  the  permissible  functions  for  factor  prices  (i.e.  r 
and  w). 


have  interesting  implications  for  the  cross-section  of  consmners.  On  the  con- 
trary, this  class  of  models  is  consistent  with  a  rich  variety  of  cross-sectional 
behavior. 

2.2     The  Cross-section  of  Consumption  Rates 

Define  cf  =  —  and  let  c^{t)  =  (cf  (f),  ...,c^(t))  be  the  vector  of  relative 

consmnptions.  We  refer  to  the  vector  c^  as  the  cross-section  of  consumptions. 
Equations  (3)  and  (5)  imply  that  the  initial  cross-section  of  consvunptions 
is  a  hneax  combination  of  the  distribution  of  all  three  individual  character- 
istics of  the  consumer  (i.e.  initial  wealth,  skills  and  tastes).  That  is,  the  jth 
element  of  c^(0)  is: 

^fl/QX  ^  mi  •  aj{0)  +  m2  •  ttj  +  rm  •  (3j 

^^    ^  TUi-  a(0)  -1-  m2  •  TT  -I-  7713  •  /? 

Diflterentiating  cf  with  respect  to  time  (use  the  Euler  equation  of  each 
consimier  in  footnote  7),  we  find  that  the  jth  element  of  c^  has  the  following 
law  of  motion: 

or,  alternatively,  we  have  that: 

where  Acf{t)  =  cf{t)  -  cf  (0).^  Equation  (7)  or  its  integral  version  (8) 
completely  characterize  the  behavior  of  the  cross-section  of  consimiptions 

^To  obtain  equation  (8),  integrate  (7)  to  find  that: 

cf{t)  =  cf{0).e    0  +p..j!—£.e    .  .dr 

0 

and  simplify  this  expression,  using  the  fact  that,  integrating  the  consumer's  Euler  equation 


from  any  initiaJ  cross-section  as  a  frinction  of  average  quantities,  prices  and 
the  distribution  of  taste  parameters,  p. 

To  understand  the  intuition  behind  equations  (7)  and  (8)  one  should  re- 
member the  trade-off  that  consumers  face  when  deciding  how  much  to  save. 
Consider  a  growing  economy,  i.e.  r  >  p.  On  the  one  hand,  since  the  rate 
of  retvirn  exceeds  the  rate  of  time  preference  consiuners  would  like  to  post- 
pone consumption  and  take  advantage  of  high  rates  of  return.  On  the  other 
hand,  since  the  economy  is  growing,  consumers  would  Uke  to  advance  con- 
sumption in  order  to  smooth  it  over  time.  The  strenght  of  this  consiunption- 
smoothing  desire  is  measured  by  the  intertemporal  elasticity  of  substitution, 

i.e.  -  +      ''    .  The  larger  this  elasticity,  the  less  the  consumer  cares  about 
u       6  •  Cj 

consimiption-smoothing  and,  as  a  result,  the  more  the  consmner  saves  and 
takes  advantage  of  growth  opportunities.  Since  all  consumers  face  the  same 
rate  of  return  and  exhibit  the  same  rate  of  time  preference,  relative  consimip>- 
tion  positions  improve  (worsen)  for  those  consumers  that  have  an  elasticity 

of  intertemporal  substitution  leirger  (smaller)  than  average,  i.e.  —  >  —  .  ^° 

Cj         c 

Let  7  =  1—  lim  e   "  .  This  Umit  is  directly  related  to  the  growth 

f— »oo 

path  of  the  economy.  For  instance,  in  an  economy  with  positive  long-rim 
growth,  the  rate  of  return  is  bounded  above  the  rate  of  time  preference 
asymptotically,  and  7=1.  In  an  economy  without  long-run  growth,  if  the 
starting  point  is  the  steady-state,  the  rate  of  return  equals  the  rate  of  time 
preference  and  7  =  0.  If  the  economy  starts  below  (above)  its  steady-state, 
then  the  rate  of  return  exceeds  (falls  short  of)  the  rate  of  time  preference, 
and  decUnes  (increases)  towards  it  monotonicaUy.  In  this  case,  0  <  7  < 
1  (0  <  7).    Using  this  definition,  we  can  write  the  Umiting  distribution  or 


(see  footnote  7): 


c(t)+/3=(c(0)+/3)-e    0 


^°Note  that  in  an  economy  with  negative  growth  just  the  opposite  is  true.  Consumers 
with  a  high  elasticity  of  intertemporal  substitution  will  save  very  little  and,  as  a  result, 
find  that  their  relative  consumption  positions  worsen. 


cross-section  of  consumptions  as  follows: 

Hm  cf  W  =  c;(0).  (l-fjjl)  +  ft.  ^  (9) 

The  limiting  distribution  of  consumptions  is  not  degenerate,  as  it  is  com- 
mon in  models  that  allow  for  cross-sectional  variation  in  tastes  to  be  reflected 
in  cross-sectional  variation  in  consimiption  growth  rates.  The  reason  is  that 
asymptotically  all  consmners  choose  the  same  rate  of  consumption  growth. 
In  economies  with  long-rtm  growth,  this  is  so  because  the  elasticities  of  in- 
tertemporal substitution  of  all  consumers  converge  asymptotically  to  6~^. 
In  economies  without  long-run  growth,  this  is  so  because  the  rate  of  retmrn 
converges  asymptotically  to  the  rate  of  time  preference.  In  either  case,  all 
consumers  choose  the  same  consumption  growth  rate  in  the  limit,  although 
there  might  be  substantial  cross-sectional  variation  in  consvmiption  growth 
rates  during  the  transition. 


2.3     The  Cross-section  of  Wealth 

a 
a 


Next  define  af  =  —  and  let  a^{t)  =  (af  (f), ...,  a^{t))  be  the  vector  of  relative 

asset  holdings  or  the  cross-section  of  wealth.   Using  equations  (2)-(5),  it  is 
easy  to  show  that  the  jth.  element  of  a^  has  the  following  law  of  motion: 


af  =  {d2-7r  +  d3-f3)-  af  -  rfa  •  tt^  -  dg  •  Pj  (10) 

where  ^2  and  d^  are  defined  as  follows: 

^  =  !2i:ii!;        d3  =  ^  (11) 

o  a 

The  integral  form  of  equation  (10)  is: 


J{d2-ir+d3-0)-dT  I  f(d2-iT+d3P)dv 

af{t)  =  af  (0)  •  eo  -  j  (^2  •  t^j  +  4  •  Pj)  •  e-  ■  dr  (12) 

0 

Either  equation  (10)  or  its  integral  version  (12)  provide  a  complete  char- 
acterization of  the  cross-section  of  wealth  as  a  function  of  ^2  a^d  d^  and 
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the  distribution  of  individual  chaxacteristics,  af{0),  ttj   and  P.    Note  that 

the  functions  ^2  aJid  ^3  depend  only  on  average  variables  and  the  latter  are 

independent  of  their  distribution  by  virtue  of  the  RC  result. 

There  are  a  number  of  interesting  questions  regarding  the  cross-  section 

of  wealth  that  can  be  framed  in  terms  of  conditions  on  the  functions  ^2 

and  ^3-  For  instance,  one  might  want  to  know  whether  in  a  paiticulax  time 

period,  holding  constant  other  individual  characteristics,  poor  consumers 

tend  to  accimiulate  faster  than  rich  ones.    This  question  is  equivalent  to 
t 

asking  whether   /  (^2  •  tt  +  da  •  /?)  •  dr  is  negative.  The  smaller  this  integral 

0 

is,  the  faster  is  the  speed  of  (conditional)  convergence  among  consxmaers. 

One  might  also  want  to  know  what  the  model  predicts  about  the  cross- 
coiuitry  correlation  between  average  growth  and  changes  in  the  cross-section 
of  wealth.  Although  the  sign  of  this  correlation  crucially  depends  on  the 
specification  of  firms  and  technology  that  one  chooses,  there  is  an  interesting 
observation  that  can  be  made  at  this  stage.  Using  the  definitions  of  ^2  and 
da,  one  can  integrate  equation  (4)  to  find  that: 

t 

J(r—mi—d2-iT—d3-P)-dT 

a{t)  =  a(0)  •  e°  (13) 

Equations  (12)  and  (13)  show  that,  if  we  observe  a  collection  of  economies 
with  the  same  (path  for)  r  and  mi,^^  one  should  find  that  the  higher  the  av- 
erage growth  rate  in  a  given  period,  the  faster  the  speed  of  (conditional) 
convergence  among  consiuners.  If  we  are  wUUng  to  assinne  that  the  distribu- 
tion of  skill  and  taste  parameters  is  more  concentrated  than  the  distribution 
of  assets,  this  imphes  that,  ceteris  paribus,  high-growth  countries  would  ex- 
perience a  faster  improvement  (or  slower  worsening)  in  measures  of  wealth 
inequality  than  low-growth  coimtries.^^ 


^^For  instance,  a  set  of  open  economies  with  identical  values  for  p  £ind  6. 

^^This  is  true  in  our  model  without  uncertsiinty.  If  consumers  are  subject  to  shocks  the 
existence  of  conditional  convergence  (divergence)  does  not  necessarily  imply  that  measures 
of  inequality  or  dispersion  improve  (worsen).  For  a  fuller  discussion  of  this  point,  see  the 
distinction  between  /3— convergence  and  ct— convergence  in  Barro  and  Sala-i-Martin  (1995, 
chapters  1  and  11). 
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2.4     The  Cross-section  of  Income 

Since  available  cross-sectional  data  is  mainly  on  personal  income,  it  is  inter- 
esting to  determine  the  model's  implications  for  the  cross-section  of  incomes, 
y-  =  r-kj+w-TTj.  As  usual,  define  yf  =  ^  and  let  y^{t)  =  (yf  (<), ...,  yj{t))  G 

3?:[  be  the  vector  of  relative  incomes.  A  simple  algebraic  manipulation  shows 
that: 


yf  =  Q-af  +  (l-a)-^  (14) 


r  •  a 
where  a  is  the  share  of  capital  income  in  total  income,  i.e.  a  = .  There- 

y 

fore,  the  only  extra  piece  of  information  required  to  derive  theoretical  restric- 
tions for  the  cross-section  of  real  incomes  is  the  function  a. 
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3     Distribution  in  Two  Popular  Models 

Much  of  existing  growth  theory  takes  the  RC  model  and  characterizes  the 
behavior  of  average  quantities  under  alternative  assumptions  about  firms  and 
technology.  The  reinterpretation  of  the  RC  model  developed  above  allows  us 
to  expand  the  range  of  these  theoretical  exercises  to  include  a  characterization 
of  the  whole  distribution  of  the  quantities  of  interest.  More  precisely,  we 
can  use  the  following  algorithm  to  extract  testable  restrictions  regarding  the 
cross-sections  of  consvunption,  wealth  and  income  from  any  RC  model: 

1.  Solve  your  preferred  RC  model  and  characterize  the  equihbrium  time- 
path  of  average  quantities,  prices  and  the  functions  d2,  d^  and  a; 

2.  Make  assimaptions  about  the  distribution  of  individual  characteristics, 
o(0),  TT  and  ,5; 

3.  With  the  information  obtained  in  the  previous  steps  at  hand,  use 
the  appropiate  cross-sectional  equations  to  obtain  testable  restrictions 
regarding  the  evolution  of  any  initial  cross-section  of  consumptions, 
wealth  and/or  real  incomes. 

Of  course,  the  usefulness  of  this  algorithm  depends  on  whether  the  class 
of  models  defined  by  our  assumptions  includes  interesting  specifications.  By 
way  of  example,  this  section  shows  that  this  is  the  case. 

3.1     The  Ramsey  Model  and  the  Kuznets  Curve 

Assiime  that  there  is  a  large  nimaber  of  identical  competitive  firms  with  firee 
access  to  the  following  technology  /  :  3fJ^  — >  !R+  ,  that  maps  the  stock  of 
capital  that  a  worker  tases  and  its  skill  parameter  to  production  per  worker. 
Let  k  be  the  average  stock  of  capital  (physical  and  human)  in  the  economy. 
We  assume  that  /  is  homogeneous  of  degree  one  and  that  fk>Q,fkk<0  and 
hm  fk  =  0.  Since  all  firms  are  identical,  they  will  use  the  same  proportions 

k — ►oo 

of  capital  and  skills,  and  these  proportions  have  to  be  the  average  ones. 
Moreover,  since  firms  are  competitive,  factor  prices  are  given  by  the  marginal 
products  of  the  relevant  factor,  i.e.  r  =  fk{k,  n)  and  w  =  f{k,  7r)-k-  fk{k,  tt). 
Moreover,  in  autarky  the  average  (net)  stock  of  assets  must  equal  the  capital 
stock,  a  =  k. 
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Average  variables  in  this  economy  behave  exactly  as  in  the  well-known 
Ramsey-Cass-Koopmans  model.  A  complete  chaxacterization  of  the  distribu- 
tive dynamics  that  are  possible  in  this  model  is  outside  the  scope  of  this 
paper.  However,  we  have  been  able  to  establish  a  nvmiber  of  interesting 
properties  for  the  logarithmic  case,  i.e.  6  =  1.  This  is  a  very  convenient  pa- 
rameter restriction  because  it  impUes  that  d2-n+d3-3  =  -,  where  z  =  — ; — . 

z  k 

^^  Accordingly,  a  cross-section  of  consvimers  exhibits  conditional  convergence 
(divergence)  in  wealth  whenever  z  is  faUing  (rising). 

To  study  this  model,  it  is  therefore  useful  to  rewrite  the  standard  Ramsey 
dynamical  system  in  (2,  k)  space  as  follows: 

i  =  (,.,+E!l4±i-f,^k,.)].z  (15) 


k  =  f{k,n)+P-z-k  (16) 

As  is  well-known,  this  system  has  a  single  steady  state  that  is  saddle- 
path  stable.  The  steady-state  values  of  z  and  k  are  implicitly  defined  by: 
fk{k*,  tt)  =  /9  and  z*  •  fc*  =  f{k*,7r)  +  p.  If  the  economy's  initial  capital  is  k*, 
we  have  that  the  cross-section  of  wealth  does  not  vary  over  time.  ^'*  This  is 
also  true  for  the  cross-section  of  consvmiptions,  since  7  =  0. 

If  the  economy  starts  from  below  the  steady-state,  we  know  that  k  ap- 
proaches monotonically  its  steady-state  value.  However,  the  behavior  of  z 
depends  crucially  on  the  properties  of  /(., .).  For  instance,  consider  first  the 
popular  Cobb-Douglas  technology:  f{k)  =  A;°  •  7r^~".  Figure  1  shows  the 
phase  diagram  of  this  example.  As  the  economy  travels  along  the  stable 
arm,  z  declines  monotonically  towards  its  steady-state  value.  As  a  resiilt, 
the  Ramsey  model  with  logarithmic  utility  and  Cobb-Douglas  technology 
predicts  conditional  convergence  during  the  transition  (from  below)  towards 
the  steady-state. 

To  caution  the  reader  against  drawing  quick  conclusions  from  the  Cobb- 
Douglas  example,  consider  this  CES  technology:   f{k)  =  (fc''  +  tt*)*  with 


z          c         k       1 
^^To  see  this,  note  that  -  = -  -  =  -  •  ((m2  -  ti))  •  tt  +  7713  •  /3)  =  da  •  tt  -t-  ds  •  /3. 

Z  C  "T"  JJ  iC  K 

^^To  see  this,  note  that  z  =  0  and,  as  a  result,  da  •  tt  +  ^3  •  /3  =  0.  Since  the  latter  is  true 
for  any  value  of  tt  and  /3,  it  follows  that  da  =  d3  =  0.  Therefore,  equation  (12)  implies 
that  kf{t)  =  fcf  (0). 
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b  <  0.  The  elasticity  of  substitution  between  capital  and  labor  is  less  than 
one,  i.e.  (1  —  6)"^  <  1  .  Figtnre  2  shows  the  phase  diagram  for  this  economy, 
assuming  that  p  is  high  enough  (so  that  the  steady-  state  is  located  in  the 
upward-sloping  region  of  the  z  =  0  Une)  and  0  =  0.  In  this  economy,  z 
is  always  increasing  and,  as  a  result  there  is  always  conditional  divergence. 
Figure  3  shows  the  case  in  which  p  is  low.  If  the  economy  starts  at  a  low 
enough  level  of  capital,  z  is  initially  increasing  and  then  declines.  At  low 
levels  of  development,  the  economy  exhibits  conditional  divergence,  while  at 
high  levels  of  development  it  is  characterized  by  conditional  convergence. 

This  last  result  is  reminiscent  of  one  of  the  most  famous  ideas  in  economic 
development:  the  existence  of  an  inverted-U  shaped  relationship  between 
measures  of  income  inequeihty  and  the  level  of  income  of  a  coimtry.  This 
curve  is  named  the  Kuznets  curve,  since  it  was  Kuznets  (1955)  who  first 
hypothesized  its  existence  and  gave  some  evidence  in  its  support.  Since  then 
a  large  number  of  empirical  studies  have  piled  up  either  to  further  document 
or  to  deny  the  existence  of  the  Kuznets  curve.  At  the  same  time,  a  substantial 
body  of  research  has  tried  to  generate  Kuznets  curves  in  theoretical  models. 
Without  taking  a  stand  on  whether  the  Kuznets  curve  is  a  fact  or  just  an 
hypothesis,  we  simply  note  that  om:  CES  example  imphes  that  the  Hamsey 
model  is  consistent  with  the  Kuznets  curve  hypothesis.  To  show  this,  we 
use  the  coefficient  of  variation  of  kj  as  our  measure  of  inequality:   CV  = 


/ 


Var  {k^\  ;  and  assume,  for  simpUcity,  that  there  is  no  cross-sectional 

variation  in  tastes  and  skills,  i.e.  /3j  =  0  and  ttj  =  tt  for  all  ji.  Simple  algebra 

shows  that: 

t 

CV{t)  =  CV{0)  •  eo  (17) 

That  is,  inequality  increases  (decreases)  in  periods  where  z  is  increasing 
(decreasing),  i.e.  d2  •  it  +  ds  •  0  >  0  {d2  •  n  +  ds  •  P  <  0).  It  follows  that, 
in  a  Ramsey  economy  with  CES  technology  as  depicted  in  figure  2,  at  low 
levels  of  development  inequality  is  rising,  while  at  sufficiently  high  levels  of 
development  inequality  is  falling.  This  is  nothing  but  the  Kuznets  curve. ^^ 
Some  intuition  for  this  result  can  be  obtained  by  noting  that,  if  /?  =  0: 

Note,  however  that  in  the  Ramsey  economy  there  is  no  Kuznets  curve  for  consumption. 
The  cross-section  of  consimiption  rates  will  always  exhibit  convergence  (if  ^3  >  0)  or 
divergence  (if  /?  <  0). 
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d2-n  +  d3-0  =  !-^ 

K 

Ceteris  paribus,  if  the  growth  rate  of  wages  is  low,  0  is  negative  and 
consumers  that  have  a  low  stock  of  capital  relative  to  their  labor  productivity 
parameter,  accumulate  capital  at  a  higher  rate.  The  intuition  for  this  result 
follows  from  the  assumption  that  the  economy  is  populated  by  permanent- 
income  consimiers.  These  constuners  calculate  the  net  present  value  of  their 
income  and  choose  their  optimal  consumption  path.  Since  all  consvuners 
have  the  same  spending  shares  (a  property  of  homothetic  preferences),  they 
all  spend  the  same  fraction  of  their  wealth  in  each  date  and  therefore  exhibit 
identical  rates  of  wealth  accmnulation.  But  the  growth  rate  of  wealth  is  a 
weighted  average  of  the  growth  rates  of  its  two  components,  the  stock  of 
capital  and  the  net  present  value  of  wages.  The  growth  rate  of  the  latter 
is  the  same  for  all  consumers.  If  this  groAvth  is  low,  consimiers  must  be 
accumulating  capital  at  a  rate  that  exceeds  that  of  total  wealth.  But  how 
much?  It  depends  on  how  large  is  the  share  of  capital  in  a  consumer's  total 
wealth.  The  lower  the  share  of  capiteil  is,  the  higher  is  the  rate  of  capital 
accimiulation  that  is  reqmred  to  sustain  the  optimal  consvmiption  path.  This 
is  why  consimiers  that  have  a  low  stock  of  capital  relative  to  their  labor 
productivity  parameter  tend  to  accumulate  capital  faster  when  the  growth 
rate  of  wages  is  low.  A  symmetric  argument  works  for  the  case  in  which 
the  net  present  value  of  wages  grows  at  a  high  rate.  In  our  Kuznets-curve 
example,  wage  growth  accelerates  as  the  economy  accumulates  capital.  If  p 
is  low  enough,  this  leads  to  a  reversal  in  the  sign  oi  d2  ■  tt  +  d^  •  0}^ 

As  the  examples  here  show,  the  distributive  djoiamics  of  the  Ramsey 
model  depend  crucially  on  the  properties  of  the  technology  that  one  as- 
sumes. Figures  1  to  3  show  examples  in  which  there  is  conditional  con- 
vergence always,  never  and  sometimes.  More  general  dynamics  are  also 
possible.  In  particular,  it  is  not  difficult  to  generate  conditional  conver- 
gence and  divergence  cycles  by  appropriately  choosing  the  production  func- 
tion. A  necessary  (but  not  sufficient)  condition  for  having  p  alternating 
periods  of  conditional  convergence  and  divergence  is  that  the  first  derivative 

of  g{k)  =  fk{k,  tt) — ,  change  signs  p  —  1  times  as  k  increases. 


^^See  Ventura  (1996)  for  a  similar  argument  in  the  context  of  international  growth 
comparisons. 
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3.2     Distribution  in  a  Linear  Growth  Model 

Assume  that  there  is  a  large  number  of  competitive  firms  with  free  access 
to  the  following  Unear  technology:  f{k)  =  A-k  +  TTj;  where  k  is  the  av- 
erage stock  of  capital  (physical  and  himian)  and  the  parameter  A  satisfies 
the  following  restrictions  A  >  p  >  A  ■  {I  —  6).  The  first  inequality  gen- 
erates positive  growth  rates,  while  the  second  inequality  ensures  that  util- 
ity is  boimded.  To  rule  out  negative  consmnption,  we  assimie  that  fc(0)  > 

P  +  TT  A-p  P  TtI 

~A~'  p+{e-l)-A'A~AJ  ' 
Since  firms  are  competitive,  factor  prices  are  given  by  the  marginal  prod- 
ucts of  the  relevant  factor,  i.e.    r  =  A  and  w  =  1.    Also,  we  have  that 
the  average  stock  of  assets  must  equal  the  capital  stock,  a  =  k.  It  follows 
immediately  from  equation  (5)  that  average  consimiption  is  given  by: 

^p+{e-l)-A    fj^  ^   n\      A-p    P 


max 


(-i)-^-f  <-) 


e 

Substituting  this  consumption  function  in  (4)  and  integrating,  we  find 
that: 

Mt)=(MO)  +  ^)-.^'-^  (19) 

U  P  +  IT  >  0  {P  +  TT  <  0)  both  average  consumption  and  the  capital 
stock  grow  at  a  rate  that  is  positive,  non-increasing  (non-decreasing)  and 
asymptotically  approaches  6~^  •  {A  —  p). 

Next  we  want  to  determine  the  cross-section  of  wealth.  ^^  Using  equation 
(12)  we  find  that: 

'  A-k{0)  +  {P  +  n)-[l-e-^') 


^^Remember  that  the  information  required  for  this  purpose  is  summarized  by  two  func- 
tions: d2  and  ds  .  It  follows  from  (18)  and  (19)  that: 

d2=  d3  =  ^^  ■  \{A-  k{0)  +  P  +  tt)  ■  e"^-'  -  {0  +  Tr)'~^ 
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This  equation  shows  that  the  relative-wealth  position  of  a  consumer  in- 
creases if  and  only  ii  Pj+ttj  >  {0  +  7r)-k^{0)  (since,  by  assumption,  A-k{0)  > 
—  {0  +  7r)).  It  follows  from  this  equation  that  the  Unear  economy  exhibits 
conditional  convergence  (divergence)  if  and  only  if  /3  +  tt  >  0  (/?  +  7r<0). 
Interestingly,  in  the  hnear  model  there  cannot  be  alternating  periods  of  con- 
ditional convergence  and  divergence,  as  in  the  Ramsey  model. 

To  understand  the  intuition  behind  equation  (20)  it  is  useful  to  distinguish 
between  the  effects  of  Pj  and  ttj  in  determining  changes  in  the  distribution 
of  wealth.  Consider  first  the  taste  parameter.  As  mentioned  in  the  previ- 
ous section,  if  a  consmner  satisfies  the  following  condition:  /3j  >  /3  •  k^{0), 
her  elasticity  of  intertemporal  substitution  exceeds  that  of  the  average  con- 
sumer. Ceteris  paribus,  this  consumer  saves  more  than  average  and  her 
relative  wealth  position  improves.  Not  surprisingly,  we  find  that,  in  a  grow- 
ing economy,  consumers  that  axe  flexible  in  the  timing  of  their  consumption 
will  be  moving  upwards  in  the  wealth  distribution.  Consider  next  the  skills 
parameter.  If  a  consimier  satisfies  the  following  condition:  ttj  >  n  ■  k^{0), 
she  has  a  larger  share  of  wage  income  in  wealth.  In  a  growing  economy,  the 
marginal  propensity  to  consume  out  of  wage  income  is  less  than  one.^^  Ce- 
teris paribus,  the  larger  is  the  share  of  wage  income  in  wealth,  the  larger  is 
the  addition  to  the  stock  of  assets  in  every  period.  As  a  result,  we  find  that 
consimaers  with  high  share  of  wage  income  in  wealth  save  more  and  move 
upward  in  the  wealth  distribution. 

Finally,  one  can  also  use  equation  (20)  to  determine  the  long-run  or 
steady-state  distribution  of  wealth: 

.■to  ^f w  =  ^fc)  ■  ^ .  4Ti + .  ^  '^^ + -''  ■  ^ .  m\  0 1 .  '"> 

As  this  expression  shows,  the  long-nm  distribution  is  a  weighted  aver- 
age of  the  initial  wealth  distribution  and  the  distributions  of  taste  and  skill 
pareimeters.  If  /?  -|-  tt  >  0,  the  limiting  distribution  of  wealth  moves  away 
from  the  initial  distribution  of  wealth  and  towards  the  distribution  of  skills 
and  tastes.  If  /S  +  n  <0,  the  limiting  distribution  of  wealth  accentuates  the 
initial  distribution  of  wealth.  This  is  nothing  but  another  way  of  looking  at 
the  convergence  properties  of  the  model,  already  discussed  above. 


^*In  this  linear  economy,  this  marginal  propensity  is  1  —  6~^  ■  (l  —  -^j. 
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4     Economic  Growth  in  a  Cross-section  of  Con- 
sumers 

In  this  section  we  estimate  a  number  of  "growth  regressions"  for  a  represen- 
tative saxaple  of  American  famihes.  We  find  evidence  of  conditional  conver- 
gence throughout  the  1970's  and  divergence  in  the  1980's.  Since,  unlike  the 
Ramsey  model,  the  hnear  model  does  not  admit  reversals  in  the  pattern  of 
convergence,  this  can  be  seen  as  an  empirical  failure  of  the  simplest  of  growth 
models. 


4.1     Objective  and  Motivation 

In  the  last  ten  years  a  substantial  hteratiore  has  studied  the  dynamic  prop>- 
erties  of  per  capita  income  in  cross-sections  of  countries  and  regions.  A 
conunon  finding  is  that,  once  a  set  of  control  variables  is  held  constant,  there 
is  a  negative  correlation  between  initial  GDP  and  subsequent  growth.  This 
finding  is  usually  referred  to  as  conditional  convergence,  because  it  impUes 
that,  conditional  on  the  controls,  economies  that  have  relatively  high  levels  of 
income  at  the  begiiming  of  the  period  will  grow  at  a  slower  rate  than  initially 
poor  coimtries. 

The  theoretical  results  of  this  paper  lead  us  naturally  to  investigate  an 
analogous  empirical  specification  for  individual  incomes.  Using  equations 
(12)  and  (14),  some  algebra  allows  us  to  write: 

Ayf  (t  +  h)  =  (Ai(f ,  h)-l)-  yf{t)  +  X^it,  h)  ■  tt,-  -f  M{t,  h)  ■  (3j         (22) 

If  Ai(i,  h)  <  1,  there  is  a  negative  (partial)  correlation  between  initial  relative 
income  and  the  subsequent  change.  ^^  This  is  akin  to  conditional  convergence: 
it  imphes  that,  conditional  on  ttj  and  0j,  lower  income  individuals  have  gained 
groimd  relative  to  higher  income  ones.  On  the  other  hand,  if  Xi{t,h)  >  1 
those  who  started  the  period  with  high  relative  incomes  find  their  relative 

^®The  coefficients  in  equation  (21)  are  given  by: 

t+h 


IT 
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position  further  improved,  an  outcome  we  will  describe  as  conditional  diver- 
gence. 

The  examples  we  develop  in  Section  3  indicate  that  different  models  have 
different  implications  for  the  expected  magnitude  of  Ai.  In  the  linear  tech- 
nology model  we  have  shown  that  dAk^{t)/dk^{0)  <  0  (and  hence  \i  <  1) 
if  and  only  ii  /3  +  -rr  >  0.  Therefore,  both  convergence  and  divergence  are 
possibiUties,  according  to  whether  the  equalizing  effect  of  labor  income  is 
stronger  or  weaker  than  the  effect  of  intertemporal  substitution  (that  goes  in 
the  opposite  direction  if  /?  <  0).  However,  there  is  no  possibility  of  sign  re- 
versal {P+TT  being  a  constant):  in  the  Unear  technology  model,  if  there  is  one 
period  characterized  by  conditional  convergence  all  other  periods  must  dis- 
play convergence  too.  On  the  other  hand,  our  Kuznets  curve  example  shows 
that,  in  the  Ramsey  model,  it  is  possible  to  have  reversals  in  the  direction  of 
distributive  dynamics. 

A  closely  related  interpretation  of  equation  (22)  is  as  a  decomposition  of 
relative  income  in  its  determinants:  initial  income,  skills  and  tastes.  Rewrite 
(22)  in  levels  (assimiuig  /3  7^  0): 

yf{t  +  h)^  Ai(t,  h) .  yf{t)  +  TT .  A2(t,  /i)  •  Trf  -F  /3  •  A3(t,  h)  ■  Pf         (23) 

where  Trf  =  ^  and  /?f  =  §.  Clearly,  Ai(t,  0)  =  1,  and  A2(t,  0)  =  Xz{t,  0)  = 
•'it  ■'         p 

0.  Now  suppose  that,  for  h  >  0,  Ai(t,  h)  <  I.  This  implies  that  tt  •  X2{t,  h)  + 
0  •  X3{t,h)  >  1  (see  footnote  18).  Hence,  conditional  convergence  implies 
that  the  contribution  of  initial  income  has  fallen  over  the  period,  while  either 
or  both  of  the  weights  of  skills  and  tastes  have  risen.  In  other  words,  the 
distribution  of  income  is  moving  away  from  the  distribution  of  initial  income, 
and  in  the  direction  of  a  combination  of  the  distributions  of  tastes  and  skills. 
An  opposite  conclusion  apphes  if  Xi{t,h)  >  1. 


t+h 

*■*"''  /  (d2-if+d3/3)<i« 

d2  •  e  -^  -dr 

t 


-a{t  +  h)-    f 
t 

t+l 

/J    ya2'-n-ra3-p/-av 
dz-e-  -dr 

t 

These  coefficients  satisfy  the  restriction  Xi(t, h)  +  A2(f , /i)  •  tt  +  A3(t,  ft)  •  /?  =  1. 
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Ovir  objective  is  to  provide  estimates  of  \i{t,h)  for  a  cross-section  of 
American  families  for  the  period  1970-1990  (and  its  subperiods).  As  is  well 
known,  this  is  an  epoch  of  increased  dispersion  in  individual  incomes.  The 
foregoing  discussion  impUes  that  whether  we  find  evidence  of  conditional 
convergence  or  divergence  has  far  reaching  impUcations  for  the  nature  of  the 
recent  increase  in  inequality. ^°  If  there  is  convergence,  increased  dispersion 
impUes  that  market  forces  have  increasingly  rewarded  skills  and  flexibiUty  in 
consiunption,  and  that  these  characteristics  are  widely  dispersed  among  the 
population.  If  there  is  divergence,  inequality  has  increased  merely  because 
the  rich  has  gotten  richer,  regardless  of  the  distribution  of  skills  and  tastes. "^^ 

4.2     Data  and  Method 

We  work  with  data  from  the  Panel  Study  of  Income  Djmamics  (PSID).  The 
PSID  contains  comprehensive  information  on  the  incomes  of  a  large  number 
of  families  and  their  members,  and  the  panel  -  initiated  in  1968  -  covers  a 
relatively  long  time  span.  We  extract  annual  cross-sections  of  total  family 
money  income  between  1970  and  1990.  This  variable  includes  income  of  all 
soiirces  (labor,  interest,  transfers,...)  earned  by  all  the  members  of  the  family. 
Equation  (22)  require  us  to  work  with  relative  incomes.  We  thus  divide 
size-adjusted  family  income  by  the  population  average."^^  Table  1  reports 
some  descriptive  statistics  of  these  cross-sections  of  family  incomes  for  three 
selected  years:  1970,  1980  and  1990.  In  every  interview  year  a  "head"  is 
identified  for  each  family.  In  order  to  compare  reasonably  homogeneous  age 
groups,  in  Table  1  we  focus,  for  each  year,  on  famihes  whose  head  is  between 
20  and  50  years  of  age. 

The  story  told  by  Table  1  is  consistent  with  a  wealth  of  recent  empirical 


^"Increasing  income  dispersion  is  not  inconsistent  with  conditional  convergence:  if  the 
skill  and  taste  coefficients  are  more  widely  dispersed  than  initial  income,  conditional  con- 
vergence can  take  place  at  the  same  time  as  absolute  divergence. 

■^^Note  that  equation  (23)  imposes  two  strong  restrictions  on  the  dynamics  of  individual 
(relative)  incomes.  First,  for  a  given  individual,  ciu-rent  income  depends  linearly  on  initial 
income.  We  refer  to  this  as  the  linearity  property.  Second,  the  coefficient  in  this  linear 
relationship  is  the  same  across  individuals.  That  is,  the  quantity  X\{t,h)  is  a  function 
of  aggregate  variables  only,  and  does  not  depend  on  any  individual  characteristic  of  the 
consimier  (i.e.  ttj  and  Pj).  We  call  this  an  invariance  property.  In  the  appendix  we  present 
a  joint  test  of  these  linearity  and  invariance  properties. 

More  details  on  how  we  put  together  our  data  are  given  in  Appendix  1. 
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Table  1:  Family  Income  Distribution  in  1970,  1980  and  1990 


Year 

Std.  Dev. 

10th  Perc. 

25th  Perc. 

75th  Perc. 

90th  Perc. 

Obs 

Avg.  age 

1970 
1980 
1990 

0.639 
0.863 
0.904 

0.355 
0.279 
0.244 

0.564 
0.499 
0.469 

1.281 
1.283 
1.299 

1.705 
1.827 
1.846 

3160 
4510 
5691 

35.173 
33.451 
36.642 

Note:  Summary  statistics  of  the  cross-sections  of  1970,  1980  and  1990  family  income 
relative  to  the  population  average.  For  each  year,  only  famihes  whose  head  is  between  20 
eind  50  yezirs  of  age  are  included. 

studies  documenting  an  increase  in  income  inequality  throughout  the  70s 
and  the  80s.  The  coefficient  of  variation  of  the  family  income  distribution  has 
increased  rather  dramatically  over  the  sample  period  (recall  that  this  measure 
of  inequality  coincides  with  the  standard  error  of  our  income  variable,  and  is 
thus  reported  in  Table  1).  In  addition,  both  the  90th-10th  and  the  75th-25fh 
percentile  ranges  have  increased  in  both  sub-decades.  Hence,  in  the  lexicon 
of  the  recent  empirical  growth  literature.  Table  1  confirms  that  the  last  20 
years  are  characterized  by  cr-divergence  of  the  family  income  distribution.  It 
remains  to  be  seen  whether  there  is  /5-divergence  or  convergence. 

We  treat  Xi{t,  h)  as  an  exogenoios  parameter  to  be  estimated  with  cross- 
sectional  techniques.  Specifically,  for  various  t  and  h,  we  nm  cross-family 
regressions  of  the  form: 


yf{t  +  h)  =  j'Xj  +  8  ■  yf{t)  +  €jit  +  h) 


(24) 


where  Xj  is  a  vector  containing  a  constant  and  a  number  of  individual  char- 
acteristics of  the  head  of  family  j  (age,  age  squared,  education,  sex  and  race) 
and  €j{t  +  h)  is  a  zero-mean  stochastic  error  term.  Hence,  we  are  treating 
'/Xj  as  the  linear  predictor  of  the  quantity  A2(t,  h)-'Kj+X3{t,  h)  -Pj  ,  given  Xj. 
This  method  allows  us  to  interpret  6  as  an  estimate  of  Xi{t,h)  in  equation 
(23). 

Estimation  of  (24)  reqioires  an  instrumental  variable  for  yf'it).  First, 
the  error  term  contains  the  deviation  of  A2(t,  h)  •  ttj  +  Xz{t,h)  •  f3j  from  its 
Unear  predictor.  It  is  imhkely  that  this  deviation  wiU  be  incorrelated  with 
yf{t).  Hence,  there  is  a  potential  endogeneity  (or  omitted  variable)  problem. 
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Second,  measurement  error  is  a  pervasive  problem  in  the  PSID.  Measm-ement 
error  would  induce  downward  bias  in  an  ordinary  least  squares  estimate  of 
(24),  while  the  correlated,  omitted  individual  effect  induces  upward  bias  if 
Xi{t,  h)  <  1,  and  downward  bias  if  Ai(t,  h)  >  1. 

We  use  as  instruments  aggregate  measures  of  income  in  the  state  of  res- 
idence of  individual  j.  More  precisely,  we  use  aggregate  data  from  the  Sta- 
tistical Abstract  of  the  United  States  to  construct  the  instriunental  variables 

where  Y-^^{t)  {Yj^^{t))  is  per  capita  income  (median  family  income)  in 
the  state  where  family  j  was  resident  in  year  f,  and  Yys^{t)  {Y(js^{t))  is 
the  average  (median)  U.S.  level.  The  basic  idea  is  that  the  skill  and  taste 
characteristics  of  a  specific  family  axe  imlikely  to  have  an  impact  on  state- 
wide summary  statistics,  such  as  per  capita  income  or  median  family  income. 
Hence,  our  instruments  should  be  imcorrelated  with  the  individual  effect  in 
the  error  term.^^  On  the  other  hand,  the  income  of  a  family  is  likely  to 
be  affected  by  the  general  level  of  activity  in  the  area  where  the  family  is 
economically  active.  Therefore,  oui  instrvmients  will  have  some  degree  of 
correlation  with  the  variable  to  be  instrumented.^'* 

Although  our  empirical  exercise  is  close  in  spirit  to  the  recent  cross- 
country empirical  growth  Uterature,  our  estimation  strategy  is  quite  different. 
Because  the  error  term  (potentially)  contains  an  unobservable  idiosjoicratic 
effect  (innate  skills,  tastes)  we  cannot  estimate  a  cross-sectional  regression 
using  ordinary-least-squares.  This  problem  is  also  present  in  cross-coimtry 
growth  regressions  if  there  are  imobserved  differences  in,  say,  the  aggregate 

^^One  possible  problem  is  that  there  might  be  a  "state  eflFect"  in  the  labor  productivity 
parameter  ttj.  We  would  argue,  however,  that  the  state  effect  enters  mainly  through  sys- 
tematic differences  in  average  educational  achievement.  Because  we  control  for  education 
the  part  of  ttj  that  is  left  in  the  error  term  should  be  largely  purged  from  state  effects. 
This  intuition  is  confirmed  by  statistical  testing:  we  perform  Hansen's  tests  for  the  vahdity 
of  the  overidentifying  restrictions  for  all  the  regressions  we  present,  amd  we  always  tail  to 
reject. 

^'*In  principle,  this  correlation  may  be  tenuous.  Hence,  for  each  instnunent  and  sub- 
sample  we  have  checked  the  robustness  of  the  instrument  by  computing  the  F  statistic  for  a 
"first  stage"  ordinary  least  sqares  regression  of  ovu-  explanatory  variable  on  a  constant  and 
the  instrumental  variables  themselves.  MD  and  MN  are  the  two  candidate  instrimients, 
of  an  initially  larger  set,  for  which  such  F  tests  are  systematically  and  comfortably  high. 
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production  function.  Some  recent  empirical  growth  papers  have  tried  to 
solve  the  problem  using  panel  data  techniques,  such  as  differencing  away  the 
coimtry-specific  effect.  In  our  case,  however,  the  individual  effects  miolti- 
ply  time- varying  coefficients  {X2{t,h)  and  X3{t,h)  in  equation  23),  so  that 
differencing  would  not  eliminate  the  problem.  ^^ 

4.3     Results 

We  start  by  studying  conditional  convergence  in  the  two  ten-year  sub-periods: 
the  1970's  and  the  1980's.  Hence,  we  first  estimate  equation  (24)  with  t  = 
1970  and  h  =  10,  and  then  with  t  =  1980  and  h  =  10.  Table  2  presents  a 
variety  of  results.  Each  row  corresponds  to  a  different  combination  of  sample 
selection  criteria,  normalization  of  the  data,  Ust  of  controls  included  from 
vector  Xj,  and  age  group  of  the  household  head  at  time  t.  Hence,  the  first 
row  uses  all  families  whose  head  was  between  20  and  50  years  of  age  at  time 
t,  and  the  whole  set  of  controls.  In  the  second  row  of  Table  2  we  drop  all 
controls  (except  for  age:  we  want  to  be  sure  our  results  are  not  driven  by 
life-cycle  effects).  Rows  (3)  and  (4)  drop  the  very  rich  and  the  very  poor, 
and  rows  (5)  and  (6)  drop  fzimihes  whose  head  is  out  of  the  labor  force  in 
either  of  the  initial  and  the  final  periods.  Rows  (7)  and  (8)  add  the  diange  in 
family  size  as  a  control,  and  rows  (9)  and  (10)  normalize  total  family  money 
income  by  family  size.  Finally,  rows  (11)-(13)  further  explore  the  robustness 
of  the  results  to  compositional  effects,  by  estimating  the  growth  regressions 
for  relatively  homogeneous  age  groups. 

The  point  estimates  appear  quite  stable  across  specifications,  and  several 
of  them  are  statistically  different  from  1.  All  the  estimates  of  Ai(70, 10)  are 
less  than  1;  all  but  one  (and  all  of  the  significant)  estimates  of  Ai(80, 10) 
are  greater  than  1.  Further  variations  of  the  basic  specification  do  not  lead 
to  significant  changes  in  the  results  of  Table  2.  Hence,  we  tend  to  regard 
the  qualitative  result  that  the  convergence  coefficient  is  less  than  one  in  the 


^^Besides  being  analogous  to  cross-country  and  cross-regional  growth  regressions,  Equa- 
tion (23)  is  also  similar  in  spirit  to  the  inter-generational  income  mobihty  regressions  es- 
timated, among  many  others,  by  Solon  (1992),  Zimmermann  (1992)  and  Mulligfin  (1993). 
The  main  diflFerence  is,  of  course,  that  they  regress  childrens'  income  on  parents'  income, 
while  we  look  at  mean  reversion  (conditional  on  ttj  and  /3j)  within  the  life  span  of  an 
individual. 
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Table  2:  Distributional  Dynamics  in  the  1970s  and  1980s 


1 

2 

3 

4 

5 

6 

7 

Ai(70,10) 

Ai(80,10) 

sample 

controls 

init.  age 

obs7o 

obs8o 

1 

0.699 

1.300 

a 

a,11 

20-50 

2205 

2994 

2 

(0.237) 
0.791 

(0.203) 
1.323* 

a 

age 

20-50 

2229 

3005 

3 

(0.163) 
0.779* 

(0.130) 
1.315 

b 

all 

20-50 

2101 

2800 

4 

(0.105) 
0.819* 

(0.195) 
1.345* 

b 

age 

20-50 

2124 

2809 

5 

(0.078) 
0.831 

(0.129) 
1.386^° 

c 

all 

20-50 

1629 

1691 

6 

(0.104) 
0.875''° 

(0.212) 
1.321* 

c 

age 

20-50 

1645 

1698 

7 

(0.071) 
0.689 

(0.140) 
1.273 

a 

all,  Asize 

20-50 

2205 

2994 

8 

(0.239) 
0.785 

(0.197) 
1.277* 

a 

age,  Asize 

20-50 

2229 

3005 

9 

(0.165) 
0.686* 

(0.126) 
0.930 

d 

all 

20-50 

2079 

2872 

10 

(0.159) 
0.782 

(0.175) 
1.024 

d 

age 

20-50 

2103 

2882 

11 

(0.120) 
0.731 

(0.106) 
1.375 

a 

all 

20-30 

824 

1395 

12 

(0.224) 
0.600* 

(0.341) 
1.289 

a 

all 

30-40 

741 

1140 

13 

(0.171) 

0.362 

(0.514) 

(0.338) 

1.053 

(0.313) 

a 

all 

40-50 

796 

674 

Note.  Ck>lumn  1  (2):  estimate  of  6  in  equation  23,  with  t  =  1970  (1980)  and  h  =  10. 
Standard  errors  in  parenthesis.  The  symbol  *  (%)  indicates  that  the  95%  (90%)  confidence 
interval  does  not  include  1.  Column  3:  a:  all  famihes;  b:  drops  famihes  with  less  than 
1/10  and  more  than  10  times  average  income;  c:  JFurther  drops  ftmiilies  whose  head  is 
not  employed  in  either  t  oi  t  +  h;  d:  all  famihes,  but  income  is  divided  by  family  size. 
Column  4:  all:  age,  age^,  sex,  race,  education;  Asize:  change  in  family  size  over  estimation 
period.  Column  5:  age  group  of  family  heaxEit  t.  Column  6  (7):  sample  size  for  column  1 
(2)  regression.  Estimation  method:  optimally  weighted,  heteroscedasticity  consistent  two 
stage  least  squares. 


1970's,  and  greater  than  one  in  the  1980's  as  reasonably  robust. ^^ 

The  results  presented  in  Table  2  indicate  that  there  has  been  a  reversal 
in  distributive  dynamics  during  the  1970-1990  period.  The  1970s  emerge  as 
a  period  of  conditional  convergence:  after  controlling  for  heterogeneity  in 
skills  and  tastes,  we  find  that  individuals  who  started  the  decade  with  rel- 
atively high  incomes  have  subsequently  lost  part  of  their  adveintage.  In  the 
1980s  instead,  individual  incomes  have  conditionally  diverged.  The  appear- 
ance of  cross-individual  conditional  divergence  in  the  1980's  stands  in  sharp 
contrast  with  the  cross-regional  evidence,  that  tends  to  be  characterized  by 
convergence.^^ 

It  is  also  useful  to  read  the  results  in  terms  of  a  decomposition  of  relative 
incomes.  In  terms  of  equation  (23),  the  fact  that  Ai  has  fallen  below  1  in  the 
70's  corresponds  to  an  increased  weight  of  skills  and  tastes  in  determining 
the  relative  position  of  an  individual  in  the  distribution  of  income.  In  other 
words,  in  the  70's  individual  success  was  mainly  driven  by  one's  skills  and 
flexibihty.  The  rise  of  Ai  above  1  in  the  1980's  instead,  represents  a  fall  in 
the  contribution  of  skills  and  tastes,  and  an  increase  in  the  weight  of  initial 
wealth.  The  1980's  are  therefore  a  period  that  rewards  initial  wealth  and,  as 
a  result,  they  axe  characterized  by  a  sharp  increase  in  inequality. 

Until  now  we  have  estimated  (24)  keeping  h  fixed  and  varying  t.  We  can 
further  explore  the  reversal  in  distributive  dynamics  by  keeping  t  constant 
and  letting  h  vary.  Figure  4  plots  the  point  estimates  (and  confidence  bands) 
of  A(70,/i)  against  h,  for  h  =  1,2,  ...,20.  In  order  to  have  an  homogeneous 
sample,  we  include  only  famihes  whose  head  has  been  the  same  individual  in 
all  the  20  years.  Apart  firom  that,  the  specification  is  the  same  as  in  Row  1 
of  Table  2.  The  point  estimates  in  Figure  4  show  a  fairly  smooth  time  path, 
and  confirm  the  basic  insight  from  Table  2:  conditional  convergence  through- 
out the  1970's,  a  tiimaroimd,  and  divergence  thereafter.  Unfortimately,  the 
estimates  are  too  imprecise  to  identify  with  any  confidence  the  exact  date  of 
the  turnaroimd. 


^®  Ordinary  least  square  estimates  of  the  basic  specification  in  row  1  of  Table  2  lead 
to  point  estimates  of  0.634  for  Ai(70, 10)  and  0.513  for  Ai(80, 10).  Both  are  smaller  (the 
second  drtimatically  so)  than  those  reported  above:  as  predicted,  ordinary  least  squares 
suffer  from  severe  measurement  error  and/or  endogeneity  bias. 

^^See  the  study  of  convergence  across  the  U.S.  states  by  Barro  ans  Sala-i-Martin  (1995). 
It  is  interesting  to  note,  however,  that  they  report  much  lower  estimates  of  the  speed  of 
convergence  for  the  1980s  than  they  do  for  the  previous  decades. 
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No  matter  how  we  look  at  it,  however,  it  seems  clear  to  us  that  there 
has  been  a  reversal  in  the  patterns  of  convergence  around  the  middle  of 
our  sample  period.  This  evidence  is  inconsistent  with  the  linear  growth 
model  which,  as  we  have  shown  above,  does  not  admit  reversals.  Without 
any  further  restriction  on  technology,  the  Ramsey  model  is  consistent  with 
this  evidence.  One  should  however  note  that  simple  formulations  such  as 
logarithmic  utihty  and  Cobb-Douglas  technology  are  not  capable  to  generate 
the  pattern  in  the  data.  Therefore,  we  can  also  use  this  evidence  to  narrow 
the  set  of  empirically  admissible  technologies  that  can  be  associated  with  the 
Ramsey  model. 
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5     Concluding  Remarks 

This  paper  shows  that  the  RC  model  gives  us  ample  room  to  study  the 
forces  that  determiae  the  evolution  of  the  interpersonal  distributions  of  con- 
sumption, wealth  and  income.  We  analyze  a  class  of  growth  models  with 
three  sources  of  heterogeneity:  initial  wealth,  iimate  skills,  and  taste  for  con- 
sumption smoothing.  Despite  this  heterogeneity,  these  economies  admit  a 
representative  consumer  and,  as  a  result,  exhibit  aggregate  dynamics  that 
are  indistinguishable  from  those  of  the  homogenous-consimier  models.  Yet 
the  existence  of  heterogeneity  allows  us  to  derive  equations  describing  the 
dynamics  of  the  cross-sections  of  consumption,  wealth  and  income.  We  find 
that  the  class  of  models  analyzed  here  is  capable  of  generating  a  rich  variety 
of  distributive  dyngimics,  while  retaining  the  analj^ical  tractabihty  of  the 
homogeneous-consimier  models. 

We  think  that  a  study  of  the  distributive  impUcations  of  alternative 
growth  models  might  have  a  substantial  payoflF,  since  a  model's  abiUty  to 
generate  distributive  outcomes  that  are  consistent  with  the  data  constitutes 
a  further  criterion  against  which  to  evaluate  its  empirical  relevance.  In  this 
spirit,  we  study  the  distributive  impUcations  of  two  popular  growth  mod- 
els: the  Ramsey  and  hnear  growth  models.  We  find  that  both  of  them  are 
consistent  with  conditional  convergence  and  divergence  in  a  cross-section  of 
consumers.  However,  while  the  Ramsey  model  admits  the  possibility  of  alter- 
nating periods  of  divergence  and  convergence,  the  hnear  growth  model  does 
not.  Using  data  from  the  PSID,  we  imcover  that  there  has  been  a  reversal 
in  the  pattern  of  convergence  of  family  incomes  at  the  end  of  the  1970s  or 
beginning  of  the  1980s.  In  particular,  the  evidence  shows  that  the  decade  of 
the  1970s  was  characterized  by  conditional  convergence  while  the  decade  of 
the  1980s  exhibited  conditional  divergence.  This  finding  is  inconsistent  with 
the  linear  growth  model,  but  not  with  some  versions  of  the  Ramsey  model. 
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Appendix  1:  Data  Selection  Criteria 

We  extract  data  from  21  cross-sectional  family  data  files,  starting  from 
interview  year  1971  (which  contedns  data  on  income  in  1970)  through  1991. 
Prom  each  of  these  we  extract  the  variables  total  family  money  income,  the 
family  sampling  weight,  and  the  family  identity  niunber  for  that  year.  Prom 
the  1971,  1981  and  1991  files  we  also  extract  family  size  £ind  the  variable 
occupation  of  head,  that  allows  us  to  determine  whether  the  household  head 
was  in  the  labor  force  dmring  the  sample  period  (values  of  0  or  missing  are 
treated  as  out  of  the  labor  force).  Prom  the  1971  and  1981  files  we  frirther 
extract  the  age  and  construct  the  sex  (0  male,  1  female),  race  (0  white,  1 
nonwhite)  and  education  (9  dmnmies  corresponding  to  9  Ivels  of  educational 
achievement,  from  iUiterate  to  graduate  school)  dummies  that  we  then  in- 
clude in  vector  X.  To  compute  the  mean  of  the  family  income  distribution 
in  a  given  year  (in  order  to  construct  the  relative  income  measure)  we  use 
PSID  sample  weights.  To  Unk  families  across  time,  we  use  the  principle  that 
the  "successor"  family  at  date  t  +  h  of  a,  family  at  date  t  must  have  the 
same  individual  as  household  head.  This  is  implemented  by  extracting  the 
variables  relationship  to  head  and  identification  nimiber  from  the  cross-year 
individual  file,  and  combining  them  with  the  family  data.  If  a  family  does 
not  have  the  same  head  in  t  and  t  +  hitis  dropped  from  the  sample. 

Appendix  2:  A  Test  of  Linearity  and  In  variance 

Equation  (23)  imposes  two  strong  restrictions  on  the  dynamics  of  individ- 
ual (relative)  incomes.  Pirst,  for  a  given  individual,  current  income  depends 
linearly  on  initial  income.  We  refer  to  this  as  the  Unearity  property.  Second, 
the  coefficient  in  this  linear  relationship  is  the  same  across  individuals.  That 
is,  the  quantity  Xi{t,h)  is  a  function  of  aggregate  variables  only,  and  does 
not  depend  on  any  individual  characteristic  of  the  consimier  (i.e.  ttj  and  f3j). 
We  call  this  an  invariance  property.  In  this  appendix  we  present  a  joint  test 
of  these  linesirity  and  invariance  properties.  To  understand  the  logic  of  our 
testing  strategy,  suppose  that  we  divided  the  total  population  of  an  economy 
into  several  income  groups,  say  by  quartiles.  Next,  imagine  that  we  obtain 
separate  estimates  of  Xi{t,h)  for  each  of  these  groups.  The  Unearity  and 
invariance  properties  jointly  imply  that  the  estimated  coefficients  should  be 
the  same  for  all  groups.   Hence,  this  logic  leads  us  to  perform  a  piecewise 
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linear  regression  of  equation  (24),  and  to  test  the  restriction  that  the  slope 
coefficient  is  the  same  across  "pieces" .  Unfortunately,  as  the  sample  size  falls 
the  standaxd  errors  of  our  instrumental  variable  regressions  increase  dramat- 
ically. Suppose  that  we  estimated  equation  (24)  separately  for  four  income 
brackets.  Due  to  high  standard  errors  a  test  that  \i{t,  h)  is  the  same  across 
these  four  groups  would  have  very  low  power.^^  In  other  words,  we  need  an 
approach  involving  more  precisely  estimated  parameters,  so  as  to  give  the 
data  a  chance  of  rejecting  the  null  hypothesis  of  invariance  and  hnearity. 

A  feasible  strategy  is  available  for  the  case  in  which  Pj  =  0  for  every  j. 
Consider  three  dates:  t,  t  +  h  and  t  +  2-h.  Use  equation  (23)  (with  Pj  =  0 
and  adding  oin:  stochastic  error  term  €j(t  +  h))  to  solve  for  ttj.  Substitute 
the  resulting  expression  for  ttj  in  the  t  +  2  ■  h  version  of  equation  (23).  The 
result  is 

yf{t  +  2  •  /i)  =  ai  •  yf  (<  +  h)  +  a2  ■  yf{t)  +  ej{t  +  2-h)  (Al) 

where 

„.  =  if^l(^        „,.A,(t,2.fe)-A,(t,ft)    1-M*.2") 


and 


1-Xi{t,h)  ^        '"        '       '"    '      l-Ai(f,/i) 


e,(t  +  2.h)  =  e,{t  +  2-h)-  L^^^.,.^t  +  h) 


We  thus  have  a  reduced  form  which  is  entirely  in  terms  of  observed  variables 
(income  in  three  time  periods).  In  addition,  the  two  reduced-form  parameters 
Qi  and  02  depend  imiquely  on  the  structural-form  parameters  X{t,  2  •  h)  and 
X{t,h).  Hence,  holding  t,  t  +  h  and  t  +  2  •  h  constant,  ai  and  0:2  can  be 
estimated  with  standard  cross-  sectional  techniques.  We  implement  this  with 
to  =  1970  ,  ti  =  1980  and  ti  =  1990.2^ 


■^^ Indeed,  when  we  perfonn  the  piecewise  linear  regression  exercise  described  above  we 
can  never  reject  the  hypothesis  of  linearity  and  invariance. 

^^Estimation  of  equation  (Al)  also  requires  the  use  of  instrumental  variables.  The  reason 
is  that  the  error  term  ej{t  +  2  •  h)  is  -  by  construction  -  correlated  with  the  regressor 
Vj  {t  +  h)  (income  in  1980).  We  propose  to  use  the  age,  sex,  race  and  education  of  the 
household  head  as  instruments.  Clearly  in  this  case  the  correlation  between  instruments 
and  explanatory  variables  is  much  stronger  th£in  in  the  previous  subsection.  Hence,  this 
approach  delivers  much  more  tightly  estimated  parameters. 
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Unfortunately,  the  system  of  two  equations  in  two  unknowns  that  defines 
the  structural-form  parameters  in  terms  of  the  reduced-form  ones  is  hnearly 
dependent,  and  cannot  be  solved  for  Ai(1970,20)  and  Ai(1970,10).  Hence, 
we  cannot  apply  the  Unearity  and  invariance  test  directly  to  these  structural 
parameters.  However,  it  is  clear  that  if  the  structured-  form  parameters  are 
the  same  for  every  individual,  so  are  the  reduced-form  ones.  Hence,  a  test 
of  the  linearity  and  invariance  properties  can  be  performed  by  estimating  ai 
and  0:2  in  piecewise  hnear  fashion,  and  asking  whether  these  coeflBcients  are 
stable  across  groups. 

Table  3:  Estimates  of  Equation  (Al) 


Sample 

Qi 

0:2 

Observ. 

aU  sample 

1.268 
(.061) 

-.249 
(.054) 

1955 

group  1 

1.349 
(.269) 

-.418 
(.469) 

489 

group  2 

1.877 
(.370) 

-1.096 
(.523) 

486 

group  3 

1.612 
(.276) 

-.719 

(.328) 

493 

group  4 

1.183 
(.124) 

-.176 
(.094) 

487 

Note:  aj  and  0:2  are  generalized  instrumental  variable  estimates  of  equation  (Al)  (standard 
errors  in  parenthesis).  Dependent  variable  is  y90.  Explanatory  variables  axe  y80  and  y70. 
Instrimients  are  yTO,  sex,  race,  educTO  and  age70.  (See  Table  1  for  data  definitions). 
Group  1  has  families  with  income  below  the  first  quartile.  Group  2  between  first  quartile 
and  median.  Group  3  between  median  and  third  quartile.  Group  4  above  third  quartile. 

Table  3  reports  the  estimation  results  for  the  sample  as  a  whole  (first 
block),  and  then  for  a  piecewise  Uneax  regression  over  four  income  groups 
(second  to  fifth  block):  famihes  with  1970  income  below  the  first  quartile, 
between  the  first  quartile  and  the  median,  between  the  median  and  the  third 
quartile,  above  the  third  quartile.  The  point  estimates  appear  fairly  stable 
and  close  to  those  for  the  whole  sample.  A  possible  exception  is  group  2, 
which,  however,  is  also  the  one  for  which  the  estimates  are  least  precise. 
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In  any  case,  no  systematic  pattern  of  variation  of  the  estimates  among  the 
groups  is  discernible. 

The  formal  test  of  the  hypothesis  that  ai  and  qq  are  equal  across  the  four 
income  groups  is  reported  in  Table  4.  For  each  pair  of  income  groups,  we 
have  computed  a  Wald-test  for  the  hypothesis  that  the  estimated  coefficients 
are  the  same.^°  The  table  reports  the  corresponding  p-values.  Not  once  the 
null  hypothesis  of  invariance  is  rejected,  at  any  standard  confidence  level. 

Table  4:  Wald  test  for  income  groups  (p>-values) 


, 

0.3544  0.7062 

0.8487 

0.3544 

0.8207 

0.2047 

0.7062 

0.8207 

0.2476 

0.8487 

0.2047  0.2476 

. 

Note:  Element  i  j  is  the  p-value  associated  to  the  Wald-test  statistic  for  the  coeflScients  of 
income  groups  i  and  j. 

How  robust  are  these  results  to  other  subdivisions  of  the  sample?  When 
we  perform  the  test  on  two  groups  (below  and  above  the  median  in  1970) 
we  cannot  reject  at  any  standard  confidence  interval.  When  we  use  three 
groups  we  cannot  reject  for  any  pairwise  comparison  at  the  1%  confidence 
level,  but  we  reject  at  the  5%  level  for  one  of  the  three  comparisons.  Finally, 
when  we  use  five  groups  we  again  can  never  reject  our  null  of  linearity  and 
invariance,  at  standard  confidence  levels.  For  each  of  these  subdivisions  we 
have  also  performed  a  test  of  invariance  over  all  groups  simultaneously,  as 
opposed  to  separate  pairwise  tests  (the  two  approaches  coincide  when  there 
are  two  groups  only).  The  lowest  p-value  obtains  in  the  case  of  three  groups, 
and  it  is  0.6876.  The  reader  can  draw  his/her  own  conclusions. 


^°The  test  statistic  is  [a*  —oP]'Q~^{a'  —  oc>],  where  a*  =  [qi  Q2]'  as  estimated  for  income 
group  2  and  Q  is  an  estimate  of  the  variance  covariance  matrix  of  a^  —  ct'.  This  test  has 
the  advantage  that  it  does  not  require  homoscedasticity  across  income  groups  (hence  it  is 
better  suited  to  our  purposes  than  a  Chow  test).  Notice,  however,  that  this  test  is  known 
for  over-rejecting.  Hence,  in  selecting  it  we  set  the  odds  against  our  null. 
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Figure  4:  Conditional  Convergence,  1970-1990 
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Y-axis:  point  estimates  and  95%  confidence  bounds  of  LAMBDA(1970,h).  X-axis:  h. 
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